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The ATP-binding cassette (ABQ transporter superfamily contains membrane proteins that translocate a variety 
of substrates across extra- and intracellular membranes. Genetic variation in these genes is the cause of or 
contributor to a wide variety of human disorders with Mendelian and complex inheritance, including cystic 
fibrosis, neurological disease, retinal degeneration, cholesterol and bile transport defects, anemia, and drug 
response. Conservation of the ATP-binding domains of these genes has allowed the identification of new 
members of the superfamily based on nucleotide and protein sequence homology. Phylogenetic analysis is used 
to divide all 43 known ABC transporters into seven distinct subfamilies of proteins. For each gene, the precise 
map location on human chromosomes, expression data, and localization within the superfamily has been 
determined. These data allow predictions to be made as to potential functions or disease phenotypes associated 
with each protein. In this paper, we review the current state of knowledge on all human ABC genes in inherited 
disease and drug resistance. In addition, the availability of the complete DrosopMk genome sequence allows the 
comparison of the known human ABC genes with those in the fly genome. The combined data enable an 
evolutionary analysis of the superfamily. Complete characterization of all ABC from the human genome and 
from model organisms will lead to important insights into the physiology and the molecular basis of many 
human disorders. 



ABC Protein and Gene Organization 
The ABC genes represent the largest family of transmembrane 
proteins. These proteins bind ATP and use the energy to drive 
the transport of various molecules across all cell membranes 
(Higgins 1992; Childs and Ling 1994; Dean and Allikmets 
1995). Proteins are classified as ABC transporters based on the 
sequence and organization of their ATP-binding domains, 
also known as nucleotide-binding folds NBFs. The NBFs con- 
tain characteristic motifs, Walker A and B, separated by -90- 
120 amino acids, found in all ATP-binding proteins (Fig. 1). 
ABC genes also contain an additional element, the signature 
C motif, located just upstream of the Walker B site (Hyde et al. 
1990). The functional protein typically contains two NBFs 
and two transmembrane (TM) domains (Fig. 1). The TM do- 
mains contain 6-11 membrane-spanning a-helices and pro- 
vide the specificity for the substrate. The NBFs are located in 
the cytoplasm and transfer the energy to transport the sub- 
strate across the membrane. ABC pumps are mostly unidirec- 
tional. In bacteria, they are predominantly involved in the 
import of essential compounds that cannot be obtained by 
diffusion (e.g., sugars, vitamins, metal ions, etc.) into the cell. 
In eukaryotes, most ABC genes move compounds from the 
cytoplasm to the outside of the cell or into an extracellular 
compartment (endoplasmic reticulum, mitochondria, peroxi- 
some). Most of the known functions of eukaryotic ABC trans- 
porters involve the shuttling of hydrophobic compounds ei- 
ther within the cell as part of a metabolic process or outside 
the cell for transport to other organs, or secretion from the 
body. 
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The eukaryotic ABC genes are organized either as full 
transporters containing two TMs and two NBFs, or as half 
transporters (Hyde et al. 1990). The latter must form either 
homodimers or heterodimers to constitute a functional trans- 
porter. ABC genes are dispersed widely in eukaryotic genomes 
and are highly conserved between species, indicating that 
most of these genes have existed since the beginning of eu- 
karyotic evolution. The genes can be divided into subfamilies 
based on similarity in gene structure (half vs. full transport- 
ers), order of the domains, and on sequence homology in the 
NBF and TM domains. There are seven mammalian ABC gene 
subfamilies, five of which are found in the Saccharomyces cer- 
evisiae genome. 

Overview of Human ABC Gene Subfamilies 
A list of all known human ABC genes is displayed in Table 1. 
This list includes an analysis of recently released genome se- 
quences (Lander et al. 2001; Venter et al. 2001). Several se- 
quences remain in the genome with homology to ABC genes, 
many of which represent pseudogenes. A comprehensive de- 
scription of all ABC genes goes beyond the scope of this re- 
view. Therefore, only a concise summary of each subfamily is 
provided. 

ABCA (ABQ) 

This subfamily comprises 12 full transporters (Table 1) that 
are further divided into two subgroups based on phylogenetic 
analysis and intron structure (Broccardo et al. 1999). The first 
group includes seven genes dispersed on six different chro- 
mosomes {ABCA1, ABCA2, ABCA3, ABCA4, ABCA7, ABCA12, 
ABCA13), whereas the second group contains five genes 
{ABCAS, ABCA6, ABCA8, ABCA9, and ABCA10) arranged in a 
cluster on chromosome 17q24. The ABCA subfamily contains 
some of the largest ABC genes, several of which are >2100 
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Figure 1 Diagram of a typical ABC transporter protein. (A) A dia- 
gram of the structure of a representative ABC protein is shown with a 
lipid bilayer in yellow, the transmembrane domains in blue, and the 
nucleotide binding fold in red. Although the most common arrange- 
ment is a full-transporter with motifs arranged N-TM-NBF-TM-NBF-C, 
as shown, NBF-TM-NBF-TM, TM-NBF, and NBF-TM arrangements are 
also found. (6) The NBF of an ABC gene contains the Walker A and B 
motifs found in all ATP-binding proteins. In addition, a signature or C 
motif is also present. The most common amino acids found in these 
motifs are shown above the diagram; subfamilies often contain char- 
acteristic residues in these and other regions. 



amino acids. Two members of this subfamily, the ABCA1 and 
ABCA4 (ABCR) proteins, have been studied extensively. The 
ABCA1 protein is involved in disorders of cholesterol trans- 
port and high-density lipoproteins (HDL) biosynthesis (see 
below). The ABCA4 protein transports vitamin A derivatives 
in the outer segments of photoreceptor cells and therefore 
performs a crucial step in the visual cycle. 

ABCB (MDR/TAP) 

The ABCB subfamily is unique in that it contains both full 
transporters and half transporters. Four full transporters and 
seven half transporters are currently identified as members of 
this subfamily. ABCB1 (MDR/PGY1) is the first human ABC 
transporter cloned and characterized through its ability to 
confer a multidrug resistance phenotype to cancer cells. The 
functional sites of ABCB1 include the blood-brain barrier and 
the liver. The ABCB4 and ABCB1 1 proteins are both located in 
the liver and are involved in the secretion of bile acids. The 
ABCB2 and ABCB3 (TAP) genes are half transporters that form 
a heterodimer to transport peptides into the endoplasmic re- 
ticulum, which are presented as antigens by the Class I HLA 
molecules. The closest homolog of the TAPs, the ABCB9 half 
transporter, has been localized to lysosomes. The remaining 
four half-transporters, ABCB6, ABCB7, ABCB8 and ABCB10 
localize to the mitochondria, where they function in iron 
metabolism and transport of Fe/S protein precursors. 

ABCC (CUR/MRP) 

The ABCC subfamily contains 12 full transporters with a di- 
verse functional spectrum that includes ion transport, cell 
surface receptor, and toxin secretion activities. The CFTR pro- 
tein is a chloride ion channel that has a role in all exocrine 
secretions, and mutations in CFTR cause cystic fibrosis (Quin- 
ton 1999). ABCC8 and ABCC9 proteins bind sulfonylurea and 
regulate potassium channels involved in modulating insulin 
secretion. The rest of the subfamily is composed of nine MRP- 
related genes. Of these ABCC1, ABCC2, and ABCC3 transport 
drug conjugates to glutathione and other organic anions. The 
ABCC4, ABCCS, ABCC11 and ABCC 12 proteins are smaller 
than the other MRPl-like genes and lack an animo-terminal 
domain (Borst et al. 2000) that is not essential for transport 



function (Bakos et al. 2000). The ABCC4 and ABCCS proteins 
confer resistance to nucleosides including PMEA and purine 
analogs. 

ABCD (ALD) 

The ABCD subfamily contains four genes in the human ge- 
nome and two each in the Drosophila and yeast genomes. The 
yeast PXA1 and PXA2 products dimerize to form a functional 
transporter involved in very long chain fatty acid oxidation in 
the peroxisome (Shani and Valle 1998). All of the genes en- 
code half transporters that are located in the peroxisome, 
where they function as homo- and/or heterodimers in the 
regulation of very long chain fatty acid transport. 

ABCE (OABP) and ABCF (GCN20) 

The ABCE and ABCF subfamilies contain genes that have ATP- 
binding domains that are clearly derived from ABC transport- 
ers but have no TM domain and are not known to be involved 
in any membrane transport functions. The ABCE subfamily is 
comprised solely of the oligo-adenylate binding protein, a 
molecule that recognizes oligo-adenylate that is produced in 
response to infection by certain viruses. This gene is found in 
multicellular eukaryotes, but not in yeast, suggesting it is part 
of innate immunity. Each ABCF gene contains a pair of NBFs. 
The best-characterized member, the 5. cerevisiae GCN20 gene 
mediates the activation of the eIF-2 a-kinase (Marton et al. 
1997) and a human homolog, ABCF1, is associated with the 
ribosome and appears to have a similar role (Tyzack et al 
2000). 

ABCC (White) 

The human ABCG subfamily is comprised of six 'reverse' half 
transporters that have an NBF at the amino terminus and a 
TM domain at the carboxyl terminus. The most intensively 
studied ABCG gene is the white locus of Drosophila. The white 
protein, along with brown and scarlet, transport precursors of 
eye pigments (guanine and tryptophan) in the eye cells of the 
fly (Chen et al. 1996). The mammalian ABCG1 gene is in- 
volved in cholesterol transport regulation (Klucken et al. 
2000). Other ABCG genes include ABCG2, a drug resistance 
gene; ABCGS and ABCG8, transporters of sterols in the intes- 
tine and liver; ABCG3, to date found exclusively in rodents; 
and the ABCG4 gene that is expressed predominantly in the 
liver. The functions of the last two genes are unknown. 

ABC Genes and Human Genetic Disease 
Many ABC genes were originally discovered during the posi- 
tional cloning of human genetic disease genes. To date, 14 
ABC genes have been linked to disorders displaying Mende- 
lian inheritance (Klein et al. 1999) (Table 2). As expected from 
the diverse functional roles of ABC genes, the genetic defi- 
ciencies that they cause also vary widely. Because ABC genes 
typically encode structural proteins, all of the disorders are 
recessive, and are attributable to a severe reduction or lack of 
function of the protein. Heterozygous variants in ABC gene 
mutations, however, are being implicated in the susceptibility 
to specific complex disorders. 

Cystic Fibrosis and CFTR 

Cystic fibrosis is the most common fatal childhood disease in 
Caucasian populations, reaching frequencies ranging from 1: 
900 to 1:2500. This corresponds to a carrier frequency of 1: 
15-1:25. The disease is much less common in African and 
Asian populations, where carrier frequencies of 1:100 to 1:200 
have been estimated. The disease frequency correlates with 
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Table 1. List of Human ABC Genes, Chromo somal Location, and Function 
Gene 



Alias 



Location Subfamily 



Expression 



Function 



ABCA1 

ABCA2 

ABCA3 

ABCA4 

ABCA5 

ABCA6 

ABCA7 

ABCA8 

ABCA9 

ABCA10 

ABCA12 

ABCA13 

ABCB1 

ABCB2 

ABCB3 

ABCB4 

ABCBS 

ABCB6 

ABCB7 

ABCB8 

ABCB9 

ABCB10 

ABCB11 

ABCC1 

ABCC2 

ABCC3 

ABCC4 

ABCC5 

ABCC6 

cm 

ABCC8 

ABCC9 

ABCC10 

ABCC1 1 

ABCC12 

ABCD1 

ABCD2 

ABCD3 

ABCD4 

ABCE1 

ABCF1 

ABCF2 

ABCF3 

ABCG1 

ABCC2 

ABCC4 

ABCCS 

ABCC8 



ABO 
ABC2 

ABC3, ABCC 
ABCR 



PGY1, MDR 
TAP1 
TAP2 
PCY3 

MTABC3 

ABC7 

MABC1 

MTABC2 

SPCP 

MRP1 

MRP2 

MRP3 

MRP4 

MRPS 

MRP6 

ABCC7 

SUR 

SUR2 

MRP7 

MRP8 

MRP9 

AID 

ALDL1, ALDR 
PXMP1, PMP70 
PMP69, P70R 
OABP, RNS41 
ABCSO 



ABC8, White 
ABCP, MXR, BCRP 
White2 
White3 



9q31.1 ABO Ubiquitous 

9q34 ABC1 Brain 

16p13.3 ABC1 Lung 

1 p22.1 -p21 ABC1 Photoreceptors 

17q24 ABC1 Muscle, heart, testes 

17q24 ABC1 Liver 

1 9pl 3.3 ABC1 Spleen, thymus 

17q24 ABC1 Ovary 

17q24 ABC1 Heart 

17q24 ABC1 Muscle, heart 

2q34 ABC1 Stomach 

7p11-ql1 ABC1 Low in all tissues 

7p21 MDR Adrenal, kidney, brain 

6p21 MDR All cells 

6p21 MDR All cells 

7q21.l MDR Liver 

7pl4 MDR Ubiquitous 

2q36 MDR Mitochondria 

Xql2-ql3 MDR Mitochondria 

7q36 MDR Mitochondria 

12q24 MDR Heart, brain 

lq42 MDR Mitochondria 

2q24 MDR Liver 

1 6pl 3.1 CF/MRP Lung, testes, PBMC 

10q24 CF/MRP Liver 

1 7q21 .3 CF/MRP Lung, intestine, liver 

13q32 CF/MRP Prostate 

3q27 CF/MRP Ubiquitous 

I6p13.l CF/MRP Kidney, liver 

7q3 1 .2 CF/MRP Exocrine tissue 

11 pi 5.1 CF/MRP Pancreas 

1 2p1 2. 1 CF/MRP Heart, muscle 

6p21 CF/MRP Low in all tissues 

1 6ql1 -ql 2 CF/MRP Low in all tissues 

1 6q1 1 -ql 2 CF/MRP Low in all tissues 

Xq28 ALD Peroxisomes 

12q11-ql2 ALD Peroxisomes 

1 p22-p21 ALD Peroxisomes 

14q24.3 ALD Peroxisomes 

4q31 OABP Ovary, testes, spleen 

6p21.33 GCIM20 Ubiquitous 

7q36 GCN20 Ubiquitous 

3q25 GCN20 Ubiquitous 

21q22.3 White Ubiquitous 

4q22 White Placenta, intestine 

11q23 White Liver 

2p21 White Liver, intestine 

2p21 White Liver, intestine 



Cholesterol efflux onto HDL 
Drug resistance 

N-retinylidene-PE efflux 



Multidrug resistance 
Peptide transport 
Peptide transport 
PC transport 

Iron transport 

Fe/S duster transport 



Bile salt transport 
Drug resistance 
Organic anion efflux 
Drug resistance 
Nucleoside transport 
Nucleoside transport 

Chloride ion channel 
Sulfonylurea receptor 



VLCFA transport regulation 



Oligoadenylate binding protein 



Cholesterol transport? 
Toxin efflux, drug resistance 

Sterol transport 
Sterol transport 



the frequency of the major allele of the CF gene, a deletion of 
three base pairs (AF508). At least two other populations, how- 
ever, have high frequency CF alleles. The W1282X allele is 
found on 51% of the alleles in the Ashkenazi Jewish popula- 
tion and the 1677delTA allele has been found at a high fre- 
quency in Georgians and is also present at elevated level in 
Turkish and Bulgarian populations. This has led several 
groups to hypothesize that these alleles arose through selec- 
tion of an advantageous phenotype in the heterozygotes. It is 
through CFTR that some bacterial toxins such as cholera and 
Escherichia coli cause increased fluid flow in the intestine and 
result in diarrhea. Therefore, several researchers have pro- 
posed that the CF mutations have been selected for in re- 
sponse to these diseases. This hypothesis is supported by stud- 
ies showing that CF homozygotes fail to secrete chloride ions 
in response to a variety of stimulants, and a study in mice in 
which heterozygous null animals showed reduced intestinal 



fluid secretion in response to cholera toxin (Gabriel et al. 
1993). CFTR is also the receptor for Salmonella typhimurium 
and implication in the innate immunity to Pseudomonas ae- 
ruginosa (Pier et al. 1998). 

Patients with two severe CFTR alleles like AF508 typically 
display severe diseases with inadequate secretion of pancre- 
atic enzymes leading to nutritional deficiencies, bacterial in- 
fections of the lung, and obstruction of the vas deferens, lead- 
ing to male infertility. Patients with at least one partially func- 
tional allele display enough residual pancreatic function to 
avoid the major nutritional and intestinal deficiencies (Dean 
et al. 1990) and subjects with very mild alleles display only 
congenital absence of the vas deferens with none of the other 
symptoms of CF. Recently, heterozygotes of CF mutations 
have been found to have an increased frequency of pancre- 
atitis (Cohn et al. 1998) and bronchiectasis (Pignatti et al. 
1995). Therefore, there is a spectrum of severity in the phe- 
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Table 2. 


Diseases and Phenotyes Caused by ABC Cenes 


ene 


Mendelian disorder 


complex disease 


ABCA1 


Tanaier disease FHni D 




ABCA4 


Stargardt/FFM, RP, CRD, CD 


AMD 


ABCB1 


Digoxin uptake 


ABCB2 


Immune deficiency 


ABCB3 


Immune deficiency 




ABCB4 


PFIC-3 


ICP 


ABCB7 


XLSA/A 


ABCB11 


PFIC-2 




ABCC2 


Dubin-Johnson Syndrome 




ABCC6 


Pseudoxanthoma elasticum 




ABCC7 . 


-Cystic Fibrosis, CBAVD 


•Pancreatitis, 






bronchiectasis 


ABCC8 


FPHHi 




ABCD1 


ALD 




ABCC5 


Sitosterolemia 




ABCC8 


Sitistero|emia 





FHDLD, familial hypoapoproteinemia; FFM, fundus flavimaculatis; 
RP, retinitis pigmentosum 19; CRD, cone-rod dystrophy; CD, 
xxxx; AMD, age-related macular degeneration; PFIC, progressive 
familial intrahepatic cholestasis; ICP, intrahepatic cholestasis of 
pregnancy; XLSA/A, X-linked sideroblasosis and anemia; FPHHI, 
familial persistent hyperinsulinemic hypoglycemia of infancy; 
ALD, adrenoleukodystrophy; IDDM, insulin-dependent diabetes 
mellitus. 



notypes caused by this gene that is inversely related to the 
level of CFTR activity. Clearly, other modifying genes and the 
environment also affect disease severity, particularly the pul- 
monary phenotypes. 

Adrenoleukodystrophy 

Adrenoleukodystrophy (ALD) is an X-linked recessive disorder 
characterized by neurodegenerative phenotypes with onset 
typically in late childhood (Mosser et al. 1993). Adrenal defi- 
ciency commonly occurs and the presentation of ALD is 
highly variable. AMN, childhood ALD and adult onset forms 
are recognized, but there is no apparent correlation to ABCD1 
alleles. ALD patients have an accumulation of unbranched, 
saturated fatty acids with a chain length of 24-30 carbons, in 
the cholesterol esters of the brain and in adrenal cortex. The 
ALD protein is located in the peroxisome, where it is believed 
to be involved in the transport of very long chain fatty acids. 

Sulfonylurea Receptor 

The ABCC8 gene is a high-affinity receptor for the drug sul- 
fonylurea. Sulfonylureas are a class of drugs widely used to 
increase insulin secretion in pa- 
tients with non-insulin-dependent 
diabetes. These drugs bind to the 
ABCC8 protein and inhibit an asso- 
ciated potassium channel. Familial 
persistent hyperinsulinemic hypo- 
glycemia of infancy is an autosomal 
recessive disorder in which subjects 
display unregulated insulin secre- 
tion. The disease was mapped to 
Ilpl5-pl4 by linkage analysis, and 
mutations in the ABCC8 gene are 
found in PHH1 families (Thomas et 
al. 1995). The ABCC8 gene has also 
been implicated in insulin response 



in Mexican-American subjects (Goksel et al. 1998) and in type 
II diabetes in French Canadians (Reis et al. 2000) but not in a 
Scandinavian cohort (Altshuler et al. 2000). 

Bile Salt Transport Disorders 

Several ABC transporters are specifically expressed in the liver, 
have a role in the secretion of components of the bile, and are 
responsible for several forms of progressive familial intrahe- 
patic cholestasis (PFIC). PFICs are a heterogeneous group of 
autosomal recessive liver disorders, characterized by early on- 
set of cholestasis that leads to liver cirrhosis and failure 
(Alonso et al. 1994). The ABCB4 (PGY3) gene transports phos- 
phatidylcholine across the canalicular membrane of hepato- 
cytes (van Helvoort et al. 1996). Mutations in this gene cause 
PFIC3 (Deleuze et al. 1996; de Vree et al. 1998) and are asso- 
ciated with intrahepatic cholestasis of pregnancy (Dixon et al. 
2000). The rat Abcc2 gene was found to have a frame-shift 
mutation in the strain defective in canalicular multispecific 
organic anion transport, the TR~ rat (Paulusma et al. 1996). 
The TR" rat is an animal model of Dubin-Johnson syndrome 
and mutations in ABCC2 have been identified in Dubin- 
Johnson syndrome patients (Wada et al. 1998). The ABCC2 
protein is expressed on the canalicular side of the hepatocyte 
and mediates organic anion transport. The ABCB1 1 gene was 
originally identified based on homology to ABCB1 (Childs et 
al. 1995). ABCB11 is highly expressed on the liver canalicular 
membrane and has been shown to be the major bile salt ex- 
port pump. Mutations in ABCB11 are found in patients with 
PFIC2 (Strautnieks et al. 1998). 

Retinal Degeneration andABCA4 

The ABCA4 gene is expressed exclusively in photoreceptors 
where it transports retinol (vitamin A) derivatives from the 
photoreceptor outer segment disks into the cytoplasm (Allik- 
mets et al. 1997). The chromophore of a visual pigment rho- 
dopsin, retinal, or conjugates with phospholipids are the 
likely substrates for ABCA4, as they stimulate the ATP hydro- 
lysis of the protein (Sun et al. 1999). Mice lacking Abca4 show 
increased all-trans-retinaldehyde (all-trans-RAL) following 
light exposure, elevated phosphatidylethanolamine (PE) in 
outer segments, accumulation of the protonated Schiff base 
complex of all-trans-RAL and PE (N-retinylidene-PE), and 
striking deposition of a major lipofusdn fluorophore (A2-E) in 
retinal pigment epithelium (RPE) (Weng et al. 1999). These 
data suggest that ABCR is an outwardly directed flippase for 
N-retinylidene-PE. 

Mutations in the ABCA4 gene have been associated with 
multiple eye disorders (Allikmets 2000). A complete loss of 



Table 3. ABC Transporters Involved In Drug Resistance 
Gene Substrates 



Inhibitors 



ABCB1 



ABCC1 
ABCC2 
ABCC3 
ABCC4 
ABCCS 

ABCC2 



Colchicine, doxorubicin, VP1 6, 
adriamycin, vinblastine, digoxin, 
saquinavir, paclitaxel 
Doxorubicin, daunorubtcin, vincristine, 
VP1 6, colchicines, etoposide, rhodamine 
Vinblastine, sulfinpyrazone, 
Methotrexate, etoposide 
Nucleoside monophosphates 
Nucleoside monophosphates 
Mitoxantrone, topotecan, doxorubicin, 
daunorubicin, CPT-11, rhodamine 



Verapamil, PSC833, CC918, 
V-104, PluronicL61 

Cyclosporin A, V-l 04 



Fumitremorgin C, GF120918 
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ABCA4 function leads to retinitis pigmentosa, whereas pa- 
tients with at least one missense allele have Startgardt disease 
(STGD). STGD is characterized by juvenile to early adult onset 
macular dystrophy with loss of central vision. ABCA4 muta- 
tion carriers are also increased in frequency in age-related 
macular degeneration (AMD) patients. AMD patients display 
a variety of phenotypic features, including the loss of central 
vision after the age of 60. The causes of this complex trait are 
poorly understood, but a combination of genetic and envi- 
ronmental factors have a role. The abnormal accumulation of 
retinoids, caused by ABCA4 deficiency has been postulated to 
be one mechanism by which this process could be initiated. 
Defects in ABCA4 lead to an accumulation of retinal deriva- 
tives in the retinal pigment epithelium behind the retina! 

Mitochondrial Iron Homeostasis 

Several half transporters of the' MDR/TAP subfamily have 
been localized to the inner membrane of the mitochondria. 
The yeast ortholog of ABCB7, Atml, has been implicated in 
mitochondrial iron homeostasis, as a transporter in the bio- 
genesis of cytosolic Fe/S proteins (Kispal et al. 1997). Two 
distinct missense mutations in ABCB7 are associated with the 
X-linked sideroblastic anemia and ataxia (XLSA/A) phenotype 
(Allikmets et al. 1999). Three more half transporters from this 
subfamily, ABCB6, ABCB8 and ABCB10 have also been local- 
ized to mitochondria (Table 1). 

Sterol Transport Deficiencies 

Tangier disease is characterized by deficient efflux of lipids 
from peripheral cells, such as macrophages, and a very low 
level of HDL. The disease is caused by alterations in the 
ABCA1 gene, implicating this protein in the pathway of re- 
moval of cholesterol and phospholipids onto HDL (Young 
and Fielding 1999). Patients with hypolipidemia have also 
been described that are heterozygous for ABCA1 mutations, 
suggesting that ABCA1 variations may have a role in regulat- 
ing the level of HDLs in the blood (Marcil et al. 1999). 

Subsequently, the sterol-dependent regulation of ABCA1 
expression was shown (Langmann et al. 1999). Current mod- 
els for ABCA1 function place it at the plasma membrane 
where it mediates the transfer of phospholipid and choles- 
terol onto lipid-poor apolipoproteins to form nascent HDL 
particles. The ABCAl-mediated efflux of cholesterol is regu- 
lated by nuclear hormone receptors, such as oxysterol recep- 
tors (LXRs) and the bile acid receptor (FXR), as heterodimers 
with retinoid X receptors (RXRs) (Repa et al. 2000). 

Recently, two half-transporter genes, ABCGS and ABCG8 
were characterized (Berge et al. 2000; Lee et al. 2001), located 
head-to-head on the human chromosome 2pl5-pl6, and 
regulated by the same promoter. These genes are both mu- 
tated in families with sitosterolemia, a disorder characterized 
by defective transport of plant and fish sterols and choles- 
terol. Most likely, the two half-transporters form a functional 
heterodimer. The ABCG1 gene is also regulated by cholesterol 
(Klucken et a!. 2000) and ABCG4 is highly expressed in the 
liver, suggesting that these two genes may also be involved in 
cholesterol transport (Table 1). 

Multidrug Resistance 

Cells exposed to toxic compounds can develop resistance by a 
number of mechanisms including decreased uptake, increased 
detoxification, alteration of target proteins, or increased ex- 
cretion. Several of these pathways can lead to multidrug re- 
sistance (MDR), in which the cell is resistant to several drugs 



in addition to the initial compound. This is a particular limi- 
tation to cancer chemotherapy and the MDR cell often dis- 
plays other properties, such as genome instability and loss of 
checkpoint control, which complicate further therapy. ABC 
genes have an important role in MDR and at least six genes are 
associated with drug transport (Table 3). 

ABCBI 

The best characterized ABC drug pump is the ABCBI gene, 
formerly known as MDR1 or PGY1. ABCBI was the first hu- 
man ABC transporter cloned and characterized through its 
ability to confer a multidrug resistance phenotype to cancer 
cells that had developed resistance to chemotherapy drugs 
(Juliano and Ling 1976). ABCBI has been demonstrated to be 
a promiscuous transporter of hydrophobic substrates, hydro- 
phobic drugs including drugs including colchicine, VP16, 
adriamycin and vinblastine as well as lipids, steroids, xenobi- 
otics, and peptides (for review, see Ambudkar 1998). The gene 
is thought to have an important role in removing toxic me- 
tabolites from cells, but is also expressed in cells at the blood- 
brain barrier and presumably has a role in transporting com- 
pounds into the brain that cannot be delivered by diffusion. 
ABCBI also affects the pharmacology of the drugs that are 
substrates and a common polymorphism in the gene affects 
digoxin uptake (Hoffmeyer et al. 2000). 

ABCCI 

The ABCCI gene was identified in the small-cell lung carci- 
noma cell line NCI-H69, a multidrug resistant cell that did not 
overexpress ABCBI (Cole et al. 1992). The ABCCI pump con- 
fers resistance to doxorubicin, daunorubicin, vincristine, col- 
chicines and several other compounds, a very similar profile 
to that of ABCBI. Unlike ABCBI, however, ABCCI transports 
drugs that are conjugated to glutathione by the glutathione 
reductase pathway (Borst et al. 2000). ABCCI can also trans- 
port leukotrienes, such as leukotriene C4 (LTC4). LTC4 is an 
important signaling molecule for the migration of dendritic 
cells. Migration of dendritic cells from the epidermis to lym- 
phatic vessels is defective in Abccl' 1 ' mice (Robbiani et al 
2000). 

ABCG2 

Analysis of cell lines resistant to mitoxantrone that do not 
overexpress ABCBI or ABCCI, led several laboratories to iden- 
tify the ABCG2 {ABCP, MXR1, BCRF) gene as a drug trans- 
porter (Allikmets et al. 1998; Doyle et al. 1998; Miyake et al. 
1999). ABCG2 confers resistance to anthracycline anticancer 
drugs and is amplified or involved in chromosomal translo- 
cations in cell lines selected with topotecan, mitoxantrone, or 
doxorubicin treatment. It is suspected that ABCG2 functions 
as a homodimer because tranfection of the gene into cells 
confers resistance to chemotherapeutic drugs. ABCG2 can 
also transport several dyes such as rhodamine and Hoechst 
33462 and the gene is highly expressed in a subpopulation of 
hematopoetic stem cells (side population) that stain poorly 
for these dyes. The normal function of the gene in these cells, 
however, is unknown. ABCG2 is highly expressed in the tro- 
phoblast cells of the placenta. This suggests that the pump is 
responsible either for transporting compounds into the fetal 
blood supply, or removing toxic metabolites. The gene is also 
expressed in the intestine and ABCG2 inhibitors that could be 
useful in making substrates orally available. 
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Phylogenetic Analysis of Human ABC Genes 
The identification of the nearly complete set of human ABC 
genes allows a comprehensive phylogenetic analysis of the 
superfamily. To understand the organization of these genes 
within the subfamilies described previously, an alignment of 
the ATP-binding domains was generated and used for phylo- 
genetic analysis. Figure 2 displays a 
neighbor-joining tree resulting 
from this analysis. The proposed 
nomenclature of ABC transporters 
is in excellent agreement with the 
phylogenetic trees obtained. In par- 
ticular, all major ABC transporter, 
families are represented in the hu- 
man tree by stable clusters with 
high bootstrap values. 

This analysis provides compel- 
ling evidence for frequent domain 
duplication of ATP-binding do- 
mains in ABC transporters. Virtu- 
ally invariably, both ATP-binding 
domains within a gene are more 



closely related to each other than to ATP-binding domains 
from ABC transporter genes of other subfamilies. This could 
be explained by a concerted evolution of domains within the 
same gene, but this seems unlikely because the two domains 
within each gene are substantially diverged. Therefore, it ap- 
pears that duplication of ATP-binding domains within major 



Figure 2 Phylogenetic tree of the hu- 
man ABC genes. ATP-binding domain 
proteins were identified using the model 
ABC_tran (accession PF00005) of the 
pfam database (Bateman et al. 1999). 
The hmmsearch program from the 
HMMER package (Eddy 1998) and a set 
of custom-made service scripts were 
used to extract ATP-binding domains 
from all protein sequences of interest. 
Note that some proteins analyzed con- 
tain two ATP-binding domains (denoted 
on the figures as I and II), whereas oth- 
ers contained onry one ATP-binding do- 
main. Alignments were generated with 
the hidden Markov mode! (Eddy 1995) 
based on the hmmalign program using 
the ABC_tran model. The resulting mul- 
tiple alignment was analyzed with 
NJBOOT (N. Takezaki, pers. comm.) 
implementing the neighbor-joining 
tree-making algorithm (Saitou and Nei 
1987) and the number at the branch of 
the nodes represents the value from 1 00 
replications. The distance measure be- 
tween sequences used for tree making 
was the Poisson correction for multiple 
hits (Zuckerkandl and Pauling 1 965). To 
verify the position of the previously un- 
known subgroup of Drosophila genes 
(CG6162, CC6162, and CC 7 7 147), they 
were aligned with a representative of 
each of the human subfamilies. Because 
some of the human proteins had two 
ATP-binding domains, the set contained 
three Drosophila and 12 human se- 
quences. The jTT (Jones et al. 1992) 
model as defined in the MOLPHY pack- 
age with the 'star decomposition' op- 
tion was employed. The tentative best ' 
tree (the total number of possible trees 
for 15 sequences is too large for exhaus- 
tive search through all these trees) was 
then used for local maximum likelihood 
search through the surrounding tree to- 
pologies. 
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ABC families was attributable to several independent duplica- 
tion events rather than a single ancestral duplication. 

Drosophila ABC genes 

To begin to understand the organization and evolution of the 
Drosophila ABC genes, the Celera (Myers et al. 2000) and Fly- 
Base databases were searched for sequences by a combination 
of blast searches and analysis of the annotation already pres- 
ent in the databases. Initial subfamily classifications were as- 
signed based on homology and blast scores, and the loca- 



tion of each gene was recorded (Table 4). In total, 56 genes 
were identified and there is at least one representative of each 
of the known mammalian subfamilies (Table 5). To confirm 
the subfamily groupings, the ATP-binding domain amino acid 
sequences were used to perform phylogenetic analyses. A rep- 
resentative tree is shown in Figure 3. As expected, genes from 
the same subfamily cluster together and confirm the initial 
assignments made by inspection. 

As in the human and yeast genomes, the Drosophila ABC 
genes are largely dispersed in the genome. There are four clus- 



Table 4. Drosophila ABC Genes 



Cene 



Alias Protein acc. DNA acc. Size <AA), FAMILY Location (chr. nuc) Cyto. loc 



Mutants 



CG3156 

CC2759 

CGI 703 

CGI 824 

CC9281 

CG8473 

CGI 2703 

CG1819 

CG1718 

CG1801 

CGI 494 

CG3164 

CG4822 

CGI 7646 

CG9892 

CC9664 

CG9663 

CG3327 

CG2969 

CG11147 

CG7806 

CG7627 

CGS853 

CGS772 

CG6214 

CG7491 

CGI 7338 

CGI 0441 

CG9270 

CG8799 

CG3879 

CG8523 

CG8908 

CG1050S 

CG 17632 

CG79SS 

CGI 0226 

Mdr6S 

CGS6S1 

CG7346 

CG4314 

CGS944 

CG60S2 

CG9330 

CGU709 

CG422S 

CG4S62 

CG4794 

CGS789 

CGI 8633 

CG1 1069 

CG6162 

CG9990 

CG1 1898 

CG11897 

CG2316 



AAF45509 
w AAF45826 
AAF48069 
AAF481 77 
AAF48493 
AAF48511 
AE003513 
AAF50847 
AAF50837 
AAF50836 
AAF50838 
AAF51548 
AAF51551 
AAF51341 
AAF51223 
AAF51131 
AAF51130 
AAF511 22 
Atet AAF51027 
AAF52284 
AAF52639 
AAF52648 
AAF52835 
Sur AAF52866 
AAF53223 
AAF53328 
AAF53736 
AAF53737 
AAF53950 
AAF58947 
Mdr49 AAF58437 
MdrSO AAF58271 
AAF57490 
AAF46706 
bw AAF47020 
AAF47526 
AAF50670 
AAF50669 
AAF50342 
AAF5O035 
st AAF49455 
AAF49305 
AAF49312 
AAF49142 
AAF54656 
AAF55241 
AAF55707 
AAF55726 
AAF56312 
AAF56360 
AAF56361 
AAF56584 
AAF56807 
AAF56870 
AAF56869 
AAF59367 



AE00341 7 609 B X 252038-254671 

AE003425 696 C X 2545753-2539884 

AE003486 901 E X 11 393813-1 1396731 

AE003489 761 B X 12363742-12360802 

AE003500 611 E X 15454374-15450765 

AE003500 2556 A X 1 551 3659-1 5523896 

AE003513 618 D X 19494615-19497465 

AE003569 1500A X 20757531-20763638 

AE003568 1 71 3 A X 20909795-20902146 

AE0Q3568 151 1 A X 20924492-2091 7580 

AE003568 1 197 A X 20896205-20901578 

AE003590 620 C 2L 1 23902-1 1 7541 

AE003590 643 C 2L 11 2000-1 1 6000 

AE003585 627 C 2L 1 720498-1 727693 

AE003582 61 5 C 2L 2649300-2658596 

AE003580 609 C 2L 321 1 844-3209624 

AE003580 812 G 2L 3214000-3220000 

AE003580 729 C 2L 3257267-325948 

AE003576 832 C 2L 4251 81 3-4262480 

AE00361 1 705 H 2L 5656028-5653232 

AE003620 1487 C 2L 8212839-8218079 

AE003620 1327C 2L 8262316^8256791 

AE003626 689 C 2L 98541 1 9-9847658 

AE003627 2250 C 2L 1 01 05357-1 0089272 

AE003637 1896 C 2L 12619174-12641593 

AE003641 324 A 2L 1 3675599-1 3676775 

AE003661 1 275 B 2L 1 8829742-1 8834099 

AE003661 1307 B 2L 18835157-18839979 

AE003668 1014 C 2L 20741 821-2073831 7 

AE003833 1 344 C 2R 4426560-4431 236 

AE003820 1279 B 2R 7940090-7934079 

AE003815 1313 B 2R 9235904-9241222 

AE003792 1 382 A 2R 1 5203694-1 5208725 

AE003453 1283 C 2R 1 6226805-1 611269% 

AE003461 755 C 2R 1 8476505-1 8465883 

AE003472 606 B 3L 1597621-1602155 

AE003563 1320 B 3L 6180561-6175400 

AE0O3563 1302 B 3L 6186691-6181468 

AE003S53 611 E 318895129-8892720 

AE003544 597 C 3L 1 1 555624-1 1 559309 

AE003527 666 G 3L 1 6398050-1 640071 5 

AE003522 1463 A 3L 1 7695681-1 7689489 

AE003523 1 660 A 3L 1 7627439-1 7622025 

AE003516 708 E 3L 1971540-1947231 

AE003692 1 307 C 3R 7362645-7369141 

AE00371 0 866 B 3R 1 1 61 5803-1 1 61 2420 

AE003728 1 348 C 3R 1 5626899-1 561 9809 

AE003728 711 A 3R 15725586-15728807 

AE003748 1239 C 3R 29281221-20277309 

AE003749 702 C 3R 29625526-29622829 

AE003749 602 G 3R 206351 34-20637920 

AE003756 535 H 3R 22087630-2208841 7 

AE003766 808 H 3R 2440961 3-24429503 

AE003768 1 302 C 3R 24887241-24892598 

AE003768 1 346 C 3R 24881 629-24885998 

AE003844 730 D 4 1 54260-1 45 1 46 



1B4 

3B4 

10C10 

11B16 

13E14 

13E18-F1 

18F1-F2 

19F1 

19F2 

19F2 

19F2 

21 B 

21B 

22B3 

23A6 

23E4-23E5 

23E4-23E5 

23F 

24F8 

26A1 

29A3-A4 

29B1 

30E1-30E3 

31A2 

33F2 

34D1 

37B9 

37B9 

39A2 

45D1 

49E1 

50F1 

56F11 

57D2 

59E3 

62B1 

65A14 

65A14 

66E3-E4 

68C10-C11 

73A3 

74E3-E4 

74E3-E4 

76B6 

86F1 

89A11-A12 

92B9 

92C1 

96A7 

96B5 

96B6 

97B1 

98F1 

99A 

99A 

101F 



bth 

fir, ms, mit(1)20 



pre, MR 



vin, cin, rose 



fs, I, aor, 
mar, mfs 

ird15, smi97B, Spn-D 
spg, lethal 



Sen, 5 lethals 
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Table 5. 


ABC Gene Subfamilies in 






Characterized Eukaryotes 






Subfamily 


Human 


Drosophila 


Yeast 


A 


12 


19 


0 


B 


11 


10 


4 


C 


12 


12 


7 


D 


4 


2 


2 


E 


1 


1 


0 


F 


3 


3 


5 


C 


5 


15 


10 


H 


0 


3 


0 


Other 


0 


0 


3* 


Total 


48 


56 


31 



•Includes YDR091c, YFL028C, and YDR061w. 



ters of two genes and one cluster of four genes (Table 4). One 
of these clusters (on chromosome 2L, band 37B9) is composed 
of an ABCB and an ABCC gene indicating that this is a chance 
grouping of genes. The remaining clusters are composed of 
genes from the same subfamily and arranged in a head-to-tail 
fashion consistent with gene duplication. Because the clusters 
are themselves dispersed and involve different subfamilies, 
they presumably represent independent gene duplication 
events. 

The best-studied Drosophila ABC genes are the eye pig- 
ment precursor transporters white, scarlet, and brown (w, st, 
and br, respectively). These genes are part of the ABCG sub- 
family and have a unique NBF-TM organization. Surprisingly, 
there are 15 ABCG genes in the fly genome, making this the 
most abundant ABC subfamily. This is in sharp contrast to the 
only five and six known ABCG genes in the human and 
mouse genomes, respectively. The Drosophila ABCG genes are 
highly dispersed in the genome with only two pairs of linked 
genes. In addition, they are quite divergent phylogenetically, 
suggesting that there were many independent and ancient 
gene duplication events. 

Several Drosophila ABCB genes, Mdr49 t MdrSO, and 
Mdr6$ t have also been well characterized. A fourth member of 
this group, CG10226, was identified that is clustered with 
Udx6S (Table 4). These genes are closely related to, the human 
and mouse P-glycoproteins (ABCB1, ABCB4) and disruption 
of Mdr49 results in sensitivity to colchicines (Wu et al. 1991). 

To search for potential gene functions, each region con- 
taining a Drosophila ABC gene was searched for phenotypes 
that have been not assigned to a gene (Table 4). The most 
promising connection is the identification of several eye phe- 
notypes (vin, rose, and cln) in the region of the CG7346 gene. 
Because CG7346 is part of the ABCG family and is therefore 
related to w, st, and br, it is tempting to speculate that muta- 
tions in CG7346 cause one or more of these phenotypes. Be- 
cause ABC genes perform very diverse functions and are asso- 
ciated with varied phenotypes it is hard to gather much ad- 
ditional insight from this analysis. 

Three genes, CG9990, CG6162, and CGI 1147, were iden- 
tified that do not fit into any of the known subfamilies and, 
in fact, are most closely related to ABC genes from bacteria. 
There are no close homologs to these genes in any other eu- 
karyotic genome including worm and plants. These genes are 
within large contiguous sequences and have introns, there- 
fore they do not represent contamination from bacterial se- 
quences. This group forms a distinct cluster on the Drosophila 
tree. To elucidate the position of this group with respect to 



human ABC transporter families, we produced two datasets 
with nine and 15 sequences, respectively, containing ATP- 
binding domains from the three Drosophila genes and from 
one representative from each of the human families. In the 
smaller subset, only one ATP-binding domain from each hu- 
man gene was used. These two datasets were subjected to 
maximum likelihood analysis, with a heuristic algorithm for 
the larger dataset and with a rigorous exhaustive search for 
the smaller datasets. The result of the analysis clearly indicates 
that the new subfamily of ABC transporter in Drosophila is 
significantly different from all known families of ABC trans- 
porters (data not shown) and might have a yet unidentified 
functional role. We propose designating this new group of 
genes as subfamily H. DNA sequence searches showed that 
although significant homology is present between these three 
genes in both the NBF and TM regions, there is no homology 
in the TM regions with any other eukaryotic or prokaryotic 
ABC proteins. The closest related NBF sequences are all bac- 
terial, such as the £. coli YHIH gene, a ribosomal ATPase. This 
gene shows 36% amino acid identity in the NBF domain. 

The ATP-binding domains in a generic ABC transporter 
appear to be following a birth-and-death process that has 
been described before for other multigene families. In a typi- 
cal birth-and-death process (Ota and Nei 1994; Nei et al. 
1997), repeated genetic segments (in this case domains) ex- 
perience nearly random fluctuation in their number. That is, 
genomic deletions decrease segment number, while genome 
duplications increase it. The phylogenetic trees that we pre- 
sent here for human and Drosophila give clear evidence of the 
birth-and-death evolution of ATP-binding domains in ABC 
transporters, and similar analyses of the yeast genes support 
this. In 5. cerevisiae, there are several G subfamily ABC genes 
(e.g., YOROllw and SNQ2) that have two NBF and TM do- 
mains (Michaelis and Berkower 1995; Decottignies and Gof- 
feau 1997). In contrast, all of the Drosophila and human ABCG 
family genes are half transporters. Therefore, it appears that 
the second ATP-binding domain in animal ABCG genes was 
lost in an ancestral lineage preceding the animal radiation but 
not the animal-fungi split. Similarly, ABCE genes in humans 
are one-domain genes, whereas both Drosophila and yeast 
have two-domain homologs (RLI1 in yeast and CG5651 in 
Drosophila). Therefore, the loss of the second domain appears 
to be a relatively recent event in human ABCE genes, probably 
in the vertebrate lineage. 

Perspectives 

One of the most fascinating findings from the analysis of the 
ABC genes from the human, worm, and fly genomes is the 
remarkable similarity in the number of ABC genes. Although 
additional sequencing and annotation will undoubtedly 
change the exact number somewhat, it is clear that humans 
do not have substantially more ABC genes than much simpler 
eukaryotes. This suggests that there is a core of essential ABC 
genes that are required for all multicellular eukaryotes. These 
higher eukaryotes do have about twice the number of ABC 
genes as does S. cerevisiae, suggesting that the evolution of 
multicellularity was accompanied by the expansion in the 
number of ABC genes. It is clear, however, that certain species 
have expanded some subfamilies more than others. As men- 
tioned above, flies have considerably more ABCG genes than 
humans (15 vs. five), whereas there are at least 12 ABCA genes 
in the human and only 10 in Drosophila. Despite the remark- 
able similarity in gene number, most Drosophila ABC genes do 
not have a clear ortholog in the human genome. The likely 
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exceptions are the ABCD, ABCE, and ABCF subfamilies that 
contain nearly identical gene numbers in the two species. The 
only ABC genes in the human and yeast genomes that are 
documented to have similar function are the ABCD (ALD-like) 
genes that are involved in the transport of very long chain 
fatty acids, and the ATM1/ABCB7 gene that is present in the 
mitochondria and has a role in iron metabolism. 

A major limitation to the understanding of ABC genes is 
the difficulty in obtaining crystal structures. The three- 
dimentional structure of two bacterial NBF proteins has been 
obtained and has greatly improved our understanding of the 
organization and function of this portion of the protein 
(Hung et al. 1998). It is clear that conformational changes in 
the NBF influence substrate binding and transport, and to 
date, this can only be addressed by laborious mutagenesis and 
biochemical experiments (Hafkemeyer et al. 1998), 

Ideally, the worm and fly ABC genes could be used to 
elucidate the function of their human counterparts. To date, 
however, we know considerably more about human ABC gene 
function that we do about either the Drosophila or Caenorhab- 
ditis elegans ABC genes. In fact, the only Drosophila genes ini- 
tially identified based on their function are the eye pigment 
genes. Similarly, only one ABC gene in C. elegans (ced-7) was 
identified based on function (involvement in cell death; Wu 
and Horvitz 1998). It is not obvious why more ABC genes are 
not associated with visible or detectable phenotypes in Dro- 
sophila or C. elegans as many of the human ABC genes affect 
morphology, behavior, lifespan, and fertility and these are 
traits that have been observed and selected for repeatedly in 
these organisms. These model organisms, however, do pro- 
vide an important resource for the future systematic study of 
ABC genes. Analysis of ABC gene expression combined with 
gene disruptions should yield important clues to gene func- 
tion. In addition, the capability of designing suppressor 
screens may allow the identification of pathways that involve 
ABC genes. 
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